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Abstract 

Power series expansions naturally arise whenever solutions of ordinary differential equations are 
studied in the regime of perturbation theory. In the case of quasi-periodic solutions the issue of 
convergence of the series is plagued of the so-called small divisor problem. In this paper we review a 
method recently introduced to deal with such a problem, based on renormalisation group ideas and 
multiscale techniques. Applications to both quasi-integrable Hamiltonian systems (KAM theory) and 
non-Hamiltonian dissipative systems are discussed. The method is also suited to situations in which 
the perturbation series diverges and a resummation procedure can be envisaged, leading to a solution 
which is not analytic in the perturbation parameter: we consider explicitly examples of solutions 
which are only C°° in the perturbation parameter, or even defined on a Cantor set. 

1 Introduction 

Consider ordinary differential equations of the form 

D e u = eF(u,ut), (1.1) 

where u = (ui, . . . ,u n ) G R™, e £ I and uj G R m are parameters, called respectively the perturbation 
parameter and the frequency vector of the forcing, F : A x T m — > R™ is a real analytic function, with 
T = R/27rZ and A C R™ an open set, and D e is a differential operator possibly depending on e, 

D e = d\° + aiedt 1 + a 2 e 2 dl 2 + . . . , io,*i,*2, • • • € N. (1.2) 

In particular we shall consider explicitly the cases D £ = df and D £ — dt + £<9 t 2 . The case m = is allowed 
and corresponds to F — F(u). If m > 1, we can assume without loss of generality that the vector u has 
rationally independent components. 

We also assume that for e = the unperturbed equation 

D a u = (1.3) 

admits a quasi-periodic solution uq — uo(ojf), with u> G R p , p > 0, possibly trivial (that is p = 0, which 
gives a constant). For instance, if u is an angle, u G T™, and D £ — 9 t 2 , one has uq = cq + £lt, with 
ca,fl G R"; up to a linear change of coordinates, we can always write SI = (u>,0, ... ,0), such that the 
vector u> € R p , p < n, has rationally independent components. If u G R™ and, say, D e — dt + ed 2 , one 
has Mo = Co, where Co G R™ is a constant vector. 

We are interested in quasi-periodic solutions to (jl.l|) with rotation vector (or frequency vector) lj — 
(<jJ,co) G R d , d = p + m, that is solutions of the form u = u(u>t, e), with 

u(^e)= ^n u (s), (1.4) 
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where v ■ ip = viipi + . . . + Vdipd denotes the standard scalar product. For to = we have d = p and 
id = ui, while for p — we have d = to and id — to. A suitable Diophantine condition will be assumed on 
oj, for instance the standard Diophantine condition 

\id ■ v\ > j\v\- T Vi/eZ d \{0}, (1.5) 

where 7 > and r > d — 1 are the Diophantine constant and the Diophantine exponent, respectively, and 
\v\ = \ \ + . . . + \ fd\- Weaker Diophantine conditions could be considered - see Section [12. II 
The operator D e acts as a multiplication operator in Fourier space, that is 

(D e u) v = 5(oj ■ v,s)u u , (1.6) 

with 5(x, e) = So(x)+eS 1 (x)+e 2 52{x) + . . .. For instance, if D e = d\ then 5(w-v, e) = 8q(lq-v) — (ilu-v) 2 , 
if D e — dt + sdf then 6(u> ■ v, e) = ioj ■ v (1 + ieuj ■ v), and so on: we shall explicitly take S(u> ■ v, e) to be a 
polynomial in e. In other words D e can be expanded as 

fco 

D e = J2^ D{k \ D {0) =D , (1.7) 

with ko € N and = akd\ k , so that 5k (id ■ v) = ak(iid ■ v) lh , for 1 < k < fco. The Diophantine condition 
(|1.5p implies 

\5 (cd-y)\ >7o|^r ro , 7o = 7 10 , T =i T. (1.8) 

The problem we address here is to find a quasi-periodic solution u(ut,e) to the full equation p.l[) . 
which continues the unperturbed solution uo(u:t), that is such that it reduces to ito(wi) as e —> 0. This 
means that we look for results holding for small values of the parameter e. Hence (jl.lj) can be seen as a 
perturbation of the equation ()1.3|) . and this explains why e is called the perturbation parameter. 

More precisely we shall be interested in both the existence and stability of such quasi-periodic solu- 
tions. In particular, our analysis accounts for the KAM theory for quasi-integrablc Hamiltonian systems 
(in a special case) and the existence of quasi-periodic attractors for strongly dissipative quasi-periodically 
forced one-dimensional systems. We can also consider discrete systems, as opposite to the continuous 
ones such as (|1.1|) . We shall see that in both cases the existence of a quasi-periodic solution for the 
dynamical system is reduced to existence of a solution for a suitable functional equation - see Section EOl 
for more insight. Extensions to more general systems will be briefly discussed in Section [T2"l 

The method we shall follow uses renormalisation group ideas, and is based on techniques of multiscale 
analysis which are typical of quantum field theory. The method is widely inspired to the original work 
of Eliasson [29] and, even more, to its reinterpretation given by Gallavotti [32]. The deep analogy with 
quantum field theory was stressed and used to full extend in subsequent papers; see for instance [46U47j . 

For other renormalisation group approaches existing in the literature to the same kind of problems 
considered in this review see for instance [6"S1 IT71 IBUl IS"9"] . 



2 Perturbation theory and formal solutions 

As first attempt, we look for quasi-periodic solutions p.4[) to (|1.1[) in the form of formal power series in 
the perturbation parameter e, 

00 

M (^ e ) = 5>V%), /'W^e'"^', fc>l, (2.1) 

k=0 v£l d 

where = uq is such that DqUq = 0. The power series expansion (12. 1H will be referred to as the 
perturbation series for the quasi-periodic solution. Perturbation series have been widely studied in the 
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literature, especially in connection with problems of celestial mechanics [7T]. They are sometimes called 
the Lindstedt series or Lindstedt-Newcomb series, from the name of the astronomers who first studied 
them in a systematical way. 

If we expand also D £ according to (|1.7|) . we obtain to all orders k > 1 



min{fc,fco } 

D oU W=- £> (p) « ( *~ p) + [F(u,ujt)f- 1) , (2.2) 

p=i 

where [F(u, u)t)]( k ' means that we take the Taylor expansion of the function u — » -F(u, •), then we expand 
u in powers of e according to (]2.1|1 , and we keep the coefficient of e k , that is 

[^( W ,^)] (fc) =f;i^( U oM),a;t) Y (2.3) 

s=0 ' fci,...,fc 3 >l 



fci+...+fe s =fe 



which for jfe = reads [F(u,ujf)] {a) = F(u (u>t),ujt). 
If uq = Cq + ll>< (and hence p = n) , we expand 



F{u ,uA) = F(c + wi, ut) = Y e iv °- e °j v °- ut F V0 , (2.4) 



with 



where vq = (vq,Vq), so as to obtain in Fourier space 

min{fc,fco} 

5o(wu)uW=- ^■v)ul k -ri + {F(u,u l t)i k - 1) , (2.5) 

p=i 

[%4?'=Ei E (i^r^c 1 — y «£° •••«£•>. ( 2 -6) 

*=" ' ' v ,vi,...,v r €Z d ki,...,k s >l 
^o+^i+.-.+^s— ^ fei+...+fe B = k 

By the analyticity assumption on the function F one has \F V \ < Soe - ^"! for suitable constants £, So > 0. 
If Mo = Co one has uj = In that case we Fourier expand F{u,uxt) only in the argument ujt, 

F(u,ut)= Y e^'^W, (2.7) 
and we still obtain (12. 511. but with v = v and 



\F{uM)i? = Yt\ E E 4 fc 1 l) ---4! s3 ' ( 2 - 8 ) 



S! 

ilo 1 ■ ■ ■ s ~— fel+...+fcs= k 



More generally, one expands 



d s u F(u ,uit)= Y j^Fs,^ (2-9) 



with coefficients T 8 ^ v bounded as \T 8 , V \ < sEoSfe ^"l for suitable constants £,^0i^i > 0, so that 

[F(u,u,t)]M =Yji Y Y (2-10) 



5=0 v ,v 1 ,...,v s ei i fci,...,fc a >i 

vo+vi + ...+v s =i> ki + ...+k s =k 
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Then, to all order k > 1 one obtains 

/ min{fc,fca} 

U >* ) = T1 A' E Sp{"-v)u^ + [F{u,u L t)] 

provided the compatibility condition 

= [f(u,wt)]< fc) , (2.12) 

holds for all fc > 0. Equations (12.11|) formally provide a recursive definition of the coefficients , v ^ 0, 

in terms of the coefficients ' of lower orders k' < k. Indeed, the Diophantine condition (|1.5p ensures 
that no denominator can be zero, and the sum over the order and Fourier labels can be easily performed 
(as we shall check explicitly later on - see Section 0]). Thus, if the compatibility condition turns out to be 

(k) 

satisfied to all orders for a suitable choice of the coefficients u Q , one has an algorithm which allows to 
construct iteratively all the coefficients of the series (I2.ip . In that case, we say that the equations (|1.1|) 
are formally solvable. 




3 Examples 

In this section we review some examples of physically relevant dynamical systems which can be written in 
the form Therefore, for all such systems the problem of existence of formal quasi-periodic solutions 

is reduced to that of showing that (|2.11[) and (|2.12p can be recursively solved. The strategy that we shall 
follow in the next sections will be to show first that a formal power series solves those equations order 
by order, and, then, to study the convergence of the series for e small enough. We shall be able to prove 
that either the series converges or it can be suitably resummed so as to give a well-defined function which 
solves the equation (jl.ip . 

3.1 Quasi-integrable Hamiltonian systems: maximal tori 

Consider the Hamiltonian 

H(a,A) = ^A 2 +e/(a), A 2 = A ■ A = Aj + . . . + A\ , (3.1) 

where (a, A) E T" x M. n are angle-action coordinates, and /: T" — + M is a real analytic function. 
More generally we could consider Hamiltonians of the form 

H(a, A) = H (A) + ef(a, A), (3.2) 

where 7io and / are both real analytic functions in T n x A, A C M. n being an open set, with Ho convex 
and / periodic in the angles a. Existence and properties of quasi-periodic solutions for quasi-integrable 
systems, that is systems described by Hamiltonians of the form (|3.2|) . are the content of KAM theory 
[SH El [68]; see [3] for a review. 

The advantage of taking the Hamiltonian (|3.ip is that the corresponding Hamilton equations lead to 
a closed equation for a, 

a. = -ed a f(a), (3.3) 

which is the form (|1 . ip with u = a, m = 0, and D £ — d 2 (so that Sq(lli ■ v) = (\lo -v) 2 , while 6k(ui ■ v) = 
for all k > 1). 

We are interested in quasi-periodic solutions of the form (|1.4[) with d = p = n (and oj = u>) . For e = 
we take as unperturbed solution uo(u>t) = c*o + u>t, with ocq € T d arbitrary and uj = u> € R d satisfying 
the Diophantine condition (| 1 .5() . 
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If we write, in agreement with (|2.ip , 

oo 

a{ut,e) = a Q + Ljt + Y,z k E e^""*"^. ( 3 - 4 ) 



fc=i 



we find to all orders k > 1 - see (gTTT} and ftTT^i 



? ; (/;i2 [W(a)]r', ^^0, (3.5a) 

- [S«/(a)]( fc - x) , (3.5b) 

where 

[^/(«)]^ = E^ E w +1 /, E < l) ---«£ s) - (3-6) 

s=0 U|),i/ Ir ..,j/,6Z" fei,...,fe s >l 
«'0+«'l + ."+l'»=«' fei + ... + fe s =fc 

Thus, equation (|3.5a|) defines recursively the coefficients o.v for all k > 1 and all u ^ 0, provided 
equation (|3.5b|) is satisfied for all k > 1. 

The compatibility condition (|3.5b|) is automatically satisfied for k = 1, because [^/(a)]^ "* = it-*/^, 
which vanishes for 1/ = 0. It is a remarkable cancellation that the condition holds for all k > 1 - see 
Section |4~4"1 -. so implying that the perturbation series ()3.4p is well-defined to all orders. The coefficients 
a o i k — 1) can be arbitrarily fixed; for instance one can set aj, ; = for all k > 1 - see Section WAX 
We shall see in Section [7] that for s small enough the series converges to a function analytic in ip = u>t. 
As a consequence, there exists a quasi-periodic solution of the form (|3.4p , analytic both in e and i/>, and 
parameterised by cto £ T": hence such a solution describes an n-dimensional invariant torus (maximal 
KAM torus). 

In this paper we confine ourselves to the Hamiltonian (|3.ip . Note that in the case ()3.2p the unperturbed 
solution is uq = (ocq + u>(Ao)t, Aq), with U)(A) = Oa'Ho(A), hence it is still of the form Cq + ujt with 
n = 2p and LUi — for i > n + 1. However, strictly speaking the Hamilton equation are not of the form 
p.ip . so the analysis should be suitably adapted - see [4"T] . 

We could also consider, instead of (|3.3p . the more general equation 

a = -ed a f(a,u)t), (3.7) 
which reduces to ()3.3p for m — 0. This is still a Hamiltonian system, with Hamiltonian 

H(a,a,A,A) = -A ■ A + uj_ ■ A + ef(a, a). (3.8) 

The unperturbed solution to (|3.7p is of the same form as before, but in that case we look for a quasi- 
periodic solution with rotation vector u> — (u),uj). 

Then the angles a evolve trivially as a(t) = ujt, whereas equation ()3.5ap has to be replaced with 



-^E^[ E (-o) S+1 i^o E <K.. a ^\ (3.9) 



„(fc) = 

( w .„)a ^ 

s =° i/ ,^i,...,^ s ez ti fci,...,fe»>i 

v +ui_ + ... + v B = v k 1 + ... + k s = k— 1 



while the compatibility condition for ^ = reads 

=£i ^ (-or +1 /, E "^•••«^ ) - (3-io) 

' u ,vi_,...,v s et d fei,...,fc»>l 
fo+^i + --- + f s =0 fci + ... + fe s = fe-l 

However, the differences with respect to the previous case are just minor ones, as one can easily work out 
by himself. 
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3.2 Discrete systems: the standard map 



Beside continuous dynamical systems we can consider discrete dynamical systems (maps), such as the 
standard map \21\ I64j - also known as Chirikov- Greene- Taylor map. 

The standard map is defined by the symplectic map from the cylinder to itself 

fx' = x + y + e sinx, 
\y' = y + £smx, 

where (x, y) € T x K. 

For e = the motion is trivial: one has a simple rotation x' = x + 2t:u;, while y is fixed to y = 2itlo, 
uj € K. For e 7^ one can look for solutions which are conjugate to a trivial rotation of some other 
variable (KAM invariant curves), i.e. one can look for solutions of the form 

x = a + u(a,e), y — 2ttlu + v(a, s), (3-12) 

with a — > a' = a + 2ttlo and the functions u, v depending analytically on their arguments. The number 
u> will be called the rotation number. 

The functions u, v are not independent from each other. One has v{a, e) — u(a, e) — u{a — 2ttuj, e), as 
it is straightforward to check: simply note that x' = x + y' by (|3.11[) and express x' and y' in terms of a 
through (|3.12|) . using that a' = a + 2ttlo. So one obtains a closed equation for the conjugating function 
u, 

Du{a 1 e) = u{a + 2ttlu, e) + u(a — 2ttuj, e) — 2u(a, e) = e sin(a + u{a, e)). (3.13) 

If such a solution exists and is analytic in e, then it has to be possible to expand the function u as Taylor 
series in e and as Fourier series in a. So we are led to write, at least formally, 

00 

„ez k=i 

so implying, by calling [F(a, e)]v the coefficient of the function F(a, e) with Fourier label v and Taylor 
label k, according to (|2.3|) . 



5 (uju)ui k) = [sin(a + u(a, e))]^ 1} , 6 (wu) = 2 [cos(2tt^) - 1] . (3.15) 

Note that, for ojv small (mod. 1), 5q{ojv) ~ ||wi/|| 2 , if ||x|| = min pe z \x— p\ denotes the distance of x from 
the nearest integer. By explicitly writing sin a = ^2 V(1 =±i{2\)~ 1 VQe ll> ° a and Taylor expanding sin(a + u) 
in u around u = 0, one finds, from (|3.13[) . 

2do(<jjv) 



E Z ^^ l) -^ ) , *>1, (3.16b) 



5ci(u)u) ^-^ s\2 

uv ' S=0 l/ +Vl + ...+Vs=" 

k 1 + ...+k B =k- 



for v 7^ 0. It is easy to check that for all k > 1 one has u„ = if \v\ > k, and one can choose = 0. 
Then (|3.16|) can be iterate by taking into account that kj < k for any j = 1, . . . , s. 

When working in Fourier space, the recursive equations (|3.16p look very similar to the equations 
(|3.5a[) for continuous systems, with n = 2. Then, one could be tempted to write the standard map as the 
stroboscopic map of a continuous system. However, it turns out that, formally, one should consider the 
singular system (known in physics as the kicked rotator) with Hamiltonian 

A 2 

H(a, A) = 2ttAi + + 2vre 8(ai - 2-ku) (cos a 2 - 1) , (3.17) 



G 



where 5 is the delta function, and set x = ct2 and y = A2 — (e/2) sin a<i [53]. Therefore the corresponding 
Hamilton equations cannot be in the class (|1.1|) . where smoothness was required. Nonetheless, as we have 
seen, in Fourier space the analysis is essentially the same. 



3.3 Quasi-integrable Hamiltonian systems: lower dimensional tori 

Consider the Hamiltonian 

H(a, (3, A, B) = X -A 2 + \b 2 +e /(a, 0), (3.18) 

where (a, A) E T r x R r and (/3, B) 6T s xi s are angle-action coordinates, with r+s = n, and / : T" — > R 
is a real analytic function. 

We can also consider the same Hamiltonian as p.l[) . but assume that a; is a resonant vector, that is 
that there exist s integer vectors V\, . . . , v s such that ujui = . . . = u>-v s = 0. If this happens, it is possible 
to perform a linear change of coordinates such that in the new coordinates oj = (u>i, . . . , Lo r , 0, . . . , 0), with 
(oji, . . . , u) r ) satisfying a Diophantine condition in W . Of course the corresponding Hamiltonian would 
be slightly more complicated than ()3.18p . For simplicity's sake we shall confine ourselves to (|3.18|) . and 
study the problem of existence of a quasi-periodic solution with rotation vector ui = . . . , w r ) which 
for e = reduces to u = (a a + u>t, /3 , A , 0), with A = lj. 

In terms of the angles (et,0) the Hamilton equations become 

(a = -ed a f{u,0), 
\j3 = -edpf( a ,f3), 

so that the unperturbed solution is (cto + u>t,/3 ), which is of the form uq = cq + fit, with CI — 
(lui, . . . , ui r , 0, . . . , 0) = (a;, 0). Hence we look for solutions of the form (|1.4p with d = p = r. 

Since (3 is expected to remain close to f3 , we Fourier expand / only in the angle a, so writing 

/(«,/?)= eiV "*MP)- (3-20) 

Therefore, (|3.19p gives, in Fourier space, 

((vvfccP = [d a f(cx ) (3)}l k - 1) 



(u-uY^ = [dpf(a,l3)] 



(fc-i) 



(3.21) 



with 



00 1 

[5 Q /(a,/3)]S fe) = £ ^ E (3-22) 

p,q=0 vo,vi fp+ s €Z" k\,...,k p+q >l 

i>a+v\ + ...+v v +q=v k 1 + ... + k p+ll =k 

and an analogous expression holding for [dpf{a, (3)] { ^ - with (iu ) p d q p +1 instead of (ii^ ) P+1 d q p . 
Again for i/ = 0we require both 

[0«/(a, (3)} ( o k) = 0, [d p f(a, (3)} { Q k) = 0, (3.23) 



for all k > 0. We shall see that the first compatibility condition is automatically satisfied for all values of 
olq and ot^o \ k > 1, while the second one requires (3 and (3^\ k > 1, to be suitably fixed. This is clear 
already to first order, where we obtain 

Uu-vfa.^ =ivf v (P ), 
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so that for v = the first equation trivially holds, whereas the second one fixes /3 to be such that 
dpfoifto) = 0, i.e. /3 must be a stationary point for the function /o(/3 ) (such a point always exists). 
Moreover to higher orders one has 

[c^/(a,/3)] fc) = a|/ (/3 )/3|, fe) + (3.25) 

for a suitable function f&k depending only on the coefficients of order strictly less than k - see Section 
14.41 for details. Thus, if we further assume that the matrix <9|/o(/3 ) be nonsingular (nondegeneracy 

condition), then we can impose the compatibility conditions [dpf(a, (3)]^ = by suitably fixing the 

(k) 

corrections f3 , k > 1, to the constant part of the j3 angles. 

In Section [1] we shall see that, at least formally, a quasi-periodic solution parameterised by a € T r 
exists for suitable values of (3 . We shall see in Section [5] that, even if the formal series is divergent, 
however it can be suitably resummed for e small enough so as to be given a meaning as a well-defined 
function analytic in xp = ujt: hence the latter describes a lower- dimensional torus. If the matrix dpfa(f3 Q ) 
is positive definite we shall say that the lower-dimensional torus is hyperbolic if e < and elliptic if e > 0; 
in the latter case we shall see that the torus exists only for some values of e, more precisely for e defined 
in a Cantor set with Lebesgue density point at the origin (Cantor is ation) - see Section [TU1 



3.4 Strongly dissipative quasi-periodically forced systems 

Consider a one-dimensional system subject to a mechanical force g, in the presence of dissipation and of 
a quasi-periodically forcing. The equation describing the system is the ordinary differential equation 

x + fx + g{x) =/(wt), (3.26) 

where x G R, 7 > is the dissipation coefficient and u € R m is the frequency vector of the forcing. We 
assume that both g: A — » R and /: T m — > R are real analytic functions, with A C R an open set. 

If the dissipation is large enough, it is natural to rewrite (|3.26|) in terms of the small parameter 
e = I/7, so as to obtain the equation 

x + ex + eg(x) = ef(wt), (3.27) 

which is of the form with n = 1, u = x, D s = &t +£<9 t 2 , and F(u,urt) — —g(u) + f(ujt). In particular 
one has 5o(uj ■ v) = iu) ■ v, Si(u> ■ v) — (iw • v) 2 , ai — 1, and 5k(u> ■ v) = aj~ = for all k > 2. 

We look for quasi-periodic solutions x(ujt 1 e) which are analytic in ip = uxt and continue the unper- 
turbed solutions xq — Co, with Co constant. Such solutions (if any) are called response solutions, as they 
have the same frequency vector as the forcing. Thus, w — lo and d — m, so that, if we write 



x(wt, e)=c + Y,£ k ei -- 4 fc) - ( 3 - 28 ) 
k=i i/eZ" 



then we obtain 



iw • vx^ + Qu ■ vf x^-V + [g(x)f-V = fJ kA (3.29) 

id 4 0) 

The first order equation gives 

(3.30) 



for k > 1, while x^ = cq and x^ = for v_ ^ 
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which fixes the value of cq (of course /o must belong to the range of g). The unperturbed solution can be 
seen as a quasi-periodic solution in the extended phase space (x, ^) £ Rx T m , where it looks like (co, wt), 
so that the full solution is of the form (cq + X(ujt, Co, e), ujt). 

If d x g{co) ^ 0, then to higher orders one has the compatibility conditions 

=9 x g{co) x p + G fe (c ,x«,...,x< fc - 1 >)=0, k>l, (3.31) 

for a suitable function Gk depending only on the coefficients of orders k' < k. An explicit calculation 
gives 

oo 

G k (co,x^,...,x^)=J2^d s x g(co) £ £ x™...x£'l (3.32) 

s = 2 ' iii,---,ii s £Z m fel,...,fe s >l 

iii+---+k s =o fei+...+fe s =fe 

Thus, (|3.31[) can be used to fix the corrections Xq°\ k > 1, to the constant part of the solution x(tJt,s). 

We shall see in Section that, under the nondegeneracy condition d x g(co) ^ 0, the series (|3.28j) can 
be resummed for e small enough into a function which depends analytically on ip = cot. 

4 Diagrammatic representation and tree formalism 

We have to study the recursive equations (|2.11|) and (|2.12j> . with [Ffaut)]^ 1 ' given by ()2.10j) . This 
will be done through a diagrammatic formalism, known as the tree formalism. 

Let us assume that one can decompose u = (u,u), with u G R" and u € R™, n + n = ri, and, 
accordingly, F = (F, F), so that for v — one has 

[F(u,ut)i k) =0, (4.1a) 
[F{ U)i jt)p =Au { V +$ k , (4.1b) 

for a suitable nonsingular matrix A and a suitable vector <£>£ (0 is the null vector in R n ). We shall see 
that in all cases considered in Section [3] this holds true - see Section 14.41 below. 

We first introduce the trees (see also [11] [56l E2] ) as the main combinatorial and graphical objects that 
we shall use in the forthcoming analysis. Then we shall provide some rules how to associate numerical 
values to the trees, so as to represent the coefficients u„ in terms of trees. 

4.1 Trees 

A connected graph Q is a collection of points (nodes) and lines connecting all of them. Denote by N(Q) 
and L(Q) the set of nodes and the set of lines, respectively. A path between two nodes is the minimal 
subset of L(Q) connecting the two nodes. A graph is planar if it can be drawn in a plane without graph 
lines crossing. 

A tree is a planar graph Q containing no closed loops. Consider a tree Q with a single special node 
vq: this introduces a natural partial ordering on the set of lines and nodes, and one can imagine that 
each line carries an arrow pointing toward the node vq. We add an extra oriented line £o exiting the 
special node vq; the added line will be called the root line and the point it enters (which is not a node) 
will be called the root of the tree. In this way we obtain a rooted tree 9 defined by N(6) — N(Q) and 
L(9) = L(Q) U £q. A labelled tree is a rooted tree 9 together with a label function defined on the sets L(9) 
and A (6>). 

We call equivalent two rooted trees which can be transformed into each other by continuously de- 
forming the lines in the plane in such a way that the lines do not cross each other. We can extend the 
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notion of equivalence also to labelled trees, by considering equivalent two labelled trees if they can be 
transformed into each other in such a way that the labels also match. In the following we shall deal 
mostly with nonequivalent labelled trees: for simplicity, where no confusion can arise, we call them just 
trees. 

Given two nodes v,w € N(0), we say that w -< V if v is on the path connecting w to the root line. 
We can identify a line i through the node v it exits by writing I — £ v . 

We call internal nodes the nodes such that there is at least one line entering them, and end nodes the 
nodes which have no entering line. We denote by V(6) and E(8) the set of internal nodes and end nodes, 
respectively. Of course N(0) = V(9) U E(6). 

The number of unlabelled trees (i.e. of rooted trees with no labels) with N nodes - and hence with 
N lines - is bounded by 2 2N , which is a bound on the number of random walks with 2N steps [38]. An 
example of unlabelled tree is represented in Figure [T] 




Figure 1: An example of unlabelled tree. 



For each node v denote by S(v) the set of the lines entering v and set s v = \S(v)\; here and henceforth, 
given a set A, with denote by \A\ its cardinality. Hence s v = if v is an end node, and s v > 1 if v is an 
internal node. One has 

£ 8v = Sv = \N(6)\-l; (4.2) 

v£N(8) veV(6) 

this can be easily checked by induction on the number of nodes of the tree. 
4.2 Labels 

We associate with each node v €E N(0) a mode label v v £ Z d , and with each line I € L(0) a momentum 
label vi € Z d , with the constraints that v v ^ if v € E(0) and 

vi v = ^2 = + ^ v e , (4.3) 

which represents a conservation rule for each node. 

We also associate with each node v € N(0) an order label k v £ {0, 1, ... , fc }, such that k v = if 
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vg v = and k v > 1 if V£ v ^ 0. We set 

fc(0)= ^ k v , v{9) = ]T ( 4 - 4 ) 

v£N(8) veN(0) 

which are called the order and the momentum of 9, respectively; note that v(9) is the momentum of the 
root line of 9. 

Finally we associate with each node v G V(9) a badge label p v S {0, 1}, such that k v £ {0, 1} if p,; = 1, 
while i/^ 7^ 0, i/^ = 0, and s v = 1 if p v = 0. 

Call 7/j ^ the set of all trees # with order fc and momentum v, with the constraint that if a line I € 
has ^ = and exits a node u with i/„ = then > 2. It is easy to check that there exists a positive 
constant n such that k{9) < k\N(6)\; simply use that s v ^ 1 when k v — 0. 



4.3 Diagrammatic rules 

We want to show that trees naturally arise when studying the equations (|2.11|) . Let be represented 
with the graph element in Figure [5] as a line with label v exiting from a ball with label (fe) . 



(k) 

Uv 



(*) 



Figure 2: Graph element. 



Then we can represent (|2.11j) graphically as depicted in FigureEl Simply expand [F(u,ut)]l as in 
(|2.10|) and represent each factor on the right hand side as a graph element according to Figure [2] 
The lines of all such graph elements enter the same node vq. This is a graphical expedient to recall the 
conservation rule: the momentum v of the root line is the sum of the mode label vq of the node vq plus 
the sum of the momenta of the lines entering vq. Note that k Vo > 1 as v ^ in (|2.1ip . 




Figure 3: Graphical representation of the recursive equations. 

We represent also (|2.12j) as in Figure [31 with the only difference that now k V(1 = and hence s Vo > 2 
(recall the definition of Tk,u at the end of Section |4~2|) . 

Given any tree 9 G Tk,v we associate with each node v G N(6) a node factor $ v and with each line 
I G L(9) a propagator <3i, by setting 

ft,: PV = 1: ^^K 1 ^-^ 1 ' V '*°> (4.5) 

\-d kv (u}-v iv )t, p v = 0, I G, ^ = 0, 
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where 1 is the n x n identity, and G is the n x n matrix of the form 



where the null matrices are n x n, n x n, and n x n, respectively, while A is the invertible matrix 
appearing in (|4.1b[) . Define the value of the tree 9 as 

Val(0):=( J] ^)( II ( 4J ) 

v£N(8) ££L{9) 

The propagators (8^ are matrices, whereas each $ v is a tensor with s v + 1 indices, which can be associated 
with the s„ + 1 lines entering or exiting v. In (|4.7| the indices of the tensors $ v must be contracted: this 
means that if a node v is connected to a node v' by a line £ then the indices of $ v and 3^/ associated 
with £ are equal to each other, and eventually one has to sum over all the indices except that associated 
with the root line. 

The node factors in (|4.5|) are bounded as maxj^.,.^ +I KfftOii—js +il — SoS^e - ^""' if = 1, while 
one has s„ = 1 and \b^ x {uj ■ V£ v )\ maxj 1 j 2 \(3v)j 1 ,j 2 \ — l w ' v l v \^ kv ~ l °^ T if p v — 0. As to the propagators 
one has ||0^|| < 7o~ 1 |^| T ° f° r v i ^ an d ||©f II < ll^ _1 || f° r v t — 0j where || • || denotes - say - the uniform 
norm. 

By iterating the graphical representation in Figure [3] until only graph elements with k = 1 appear, 
one finds 

4 fe) = Val (#), k > L ( 4 - 8 ) 

The iree expansion ()4.8[) makes sense since all node factors and propagators are finite quantities, and 
the sum over the labels can be performed. Except the mode labels, the last assertion is trivial for all 
other labels (as they can assume only a finite number of values). Finally, the sum over the mode labels 
is controlled by the exponential decay e - ^"' of the Fourier coefficients T Vv of the node factors. 

The study of the convergence of the perturbation series is made difficult by the product of propagators 
in (|4.7j) . Indeed, the denominators Sq(uj ■ v) can be arbitrarily close to zero for v large enough. This 
problem is usually referred to as the small divisor problem. 



4.4 Compatibility conditions 

Now, we show that (|4.ip holds for all the models considered in Section [31 Note that to prove (|4.ip is 
not a purely technical problem: for the very models of Section [3J quasi-periodic solutions in the form of 
formal power series or even quasi-periodic solution tout court can fail to exist, if one weaken too much 
the assumptions - see also Section 112.21 

Let us start from the model in Section 13.11 recall that d = p = n in such a case. We have already 
checked that (|3.10[) trivially holds for k = 1. Then we can prove by induction that for all k > 2 the 
compatibility condition (|3.10[) holds and one can set — 0. The proof proceeds as follows. By using 
(|4.8[) we express [F(u,uA)]^ according to (|2.6p as sum of trees in which all lines except the root line 
have nonzero momenta, by the inductive hypothesis. Given a tree 9 consider together all trees which can 
be obtained from 9 by detaching the root line and attaching it to any other node; see Figure |U 

In that way, we obtain as many trees as nodes of 9; call T{9) the set of all such trees. Of course, 
all arrows must point toward the root, so that the trees 9' G ^F{9) have all the same mode labels (by 
construction), but they can have different momenta. On the other hand, since v = 0, a line £ <E L(9') 
either has the same momentum U£ as in 9 (if the arrow has not been reverted) or has momentum —vg (if 
the arrow has been reverted). Since 5o(vg) = Sq(— vg), this means that the corresponding propagator 
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Figure 4: An example of tree 9' obtained from 8 by detaching the root line from the node vo and reattaching it 
to the node v\. The arrow of the line connecting the nodes wo and v\ is reverted so as to point toward the new 
location of the root. One can always stretch the lines so as to make all arrows go right to left, as in the last graph. 



does not change. The combinatorial factors of the trees 6' £ are in general different from those of 

9 (because the values of s v , v € N(6'), can change), but if we sum together all nonequivalent trees we 
realise (with a little effort: one must perform the computation to convince himself that the assertion is 
true!) that we obtain a common value times a factor iv Vo , if vq is the node which the root line is attached 
to. Therefore, since 



the sum of all the tree values gives zero. This implies (|4.1|) with n — n and n — 0, and the coefficient 

(k) 

a is left arbitrary, and it can be chosen to be zero. 

In fact, the argument above does not depend on the value of the coefficients af, which therefore 
can be arbitrarily chosen; in particular they can and will arbitrarily fixed to be zero. This was ex- 
pected: changing a.^ means changing the constant Qo m Q3.4[) . which is arbitrary since it is the vector 
parameterising the torus. 

The case of the standard map - see Section 13.21 - can be discussed in the same way. We omit the 
details. 

In the case of the model in Section f3T3l the identity [d a f(a, (3)]^ = for k > 2 can be proved as 
above, by relying on the same cancellation mechanism. The compatibility condition [dpf(a, (3)]^ = 
for k > 2 can be imposed by using (|3.25[) and fixing (3^ = — (9^/o(/3 )) _1 $ / r.. This implies once more 
(|4.ip with n = r and n — s. Again the coefficients can be arbitrarily set to be zero. 

Finally for the model in Section [3~4l one can use (|3.31[) to obtain (|4.ip with n = and n = n = 1. 

5 Multiscale analysis 

To be able to bound the tree value (|4.7p . we need to control the product of propagators. This will be 
done through a multiscale analysis. To this aim, for each tree line we introduce a new label characterising 
the size of the corresponding propagator, that we call the scale label. 
Essentially, we say that v £ Z d \ {0} is on scale 



where 7 is the constant appearing in (|1.5p . and we say that a line i has a scale label rig — n if vg is on 
scale n. 

As a matter of fact, in practice the sharp multiscale decomposition in (|5.1[) is a little annoying because, 
as we shall see, we have to consider derivatives. Thus, it is actually more convenient to replace it with a 
smooth decomposition through C°° compact support functions. Let ip be a nondecreasing C°° function 




(4.9) 



v GN(9) 



n>l, if 2-" 7 < \uo ■ v\ < 2"( n " 1 ) 
n = 0, if 7 < \u> ■ v\, 



(5.1) 
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defined in R + , such that 

«■>-{!: £:!?a <"» 

and set x( x ) '■— 1 — ^(x)- For all n G Z + = NU {0} define Xn(^) : = x(2"^) and tp n (x) := ip(2 n x), and 
set 

2„(x) = Xo(M) ■ ■ ■ Xn-i(M)Xn(M), = Xo(|x|) . . . Xn-i(\x\)ipn(\x\), (5.3) 

where ^oO^) is meant as ^o{x) = V'o(M)- 

Then we change the definition of the propagator to be associated with each £ with vi ^ 0, by 
associating with each such line £ a scale label rig € Z + and a propagator 

G5 £ = G [llf V G [n] {uj-v) :=^ n ^-v)8^ l {u:- V )\, (5.4) 

which replaces the previous definition in (|4.5|) . If $f n (x) ^ then 2 - ™~ 1 7 < \x\ < 2~™ +1 7, so that for any 
each £ € £(#) one has either <&g = or ||©^|| < -y^^™*^ 1 ) 10 . For completeness we also associate a scale 
label ri£ = — 1 with each line £ with momentum vg_ — 0. Note that, while with the sharp decomposition 
(|5.ip a momentum v identifies uniquely the scale n, on the contrary by using the smooth decomposition 
for each momentum v there are two possible (adjacent) values n such that (uj ■ v) =/= 0. 

A tree expansion like (|4.8j) still holds, with the difference that now the trees 9 € Tk tV carry also the 
scale labels, and we have to sum also on these labels. The equality between the two expansions follows 
immediately from the observation that X)^Lo ^n( x ) = 1 for all x G M \ {0}. 

If €R T1 (6 1 ) denotes the number of lines I € L(6) with scale ni = n, then we can bound in (|4.7[) 

oo 

n \\^>A\<% k2kio x{ 2riiomn{9 \ (5.5) 

£6L(0) n=0 

with io defined in (|1.2|) . so that the problem is reduced to bounding D r l„(6'). 

The product of propagators gives problems when the small divisors "accumulate" . To make more 
precise the idea of accumulation we introduce the notion of cluster. Once all lines of a tree 9 have been 
given their scale labels, for any n > we can identify the maximal connected sets of lines with scale not 
larger than n. If at least one among such lines has scale equal to n we say that the set is a cluster on 
scale n. Given a cluster T call L(T) the set of lines of 9 contained in T, and denote by N(T) the set of 
nodes connected by such lines. We define k(T) = X^ga^t) ^« the or der of the cluster T. 

Any cluster has either one or no exiting line, and can have an arbitrary number of entering lines. We 
call self-energy clusters the clusters which have one exiting line and only one entering line and are such 
that both lines have the same momentum; the terminology is borrowed from quantum field theory. This 
means that if T is a self-energy cluster and £\ and £2 are the lines entering and exiting T, respectively, 
then vs x = vi 21 so that 

Vv = 0. (5.6) 

veN(T) 

By construction the scales of the lines £\ and £2 can differ at most by 1, and setting ut = mining, n£ 2 }, 
by definition of cluster one has ni < nx for all £ £ L(T). 

We define the value of the self-energy cluster T whose entering line has momentum v as the matrix 

Vr(w •!/):=( 1] &)( II ( 5 - 7 ) 

»ew(T) eeL(T) 

where all the indices of the node factors must be contracted except those associated with the line £\ 
entering T and with the line £2 exiting T. 



14 



We can extend the notion of self-energy cluster also to a single node, by saying that v is a self-energy 
cluster if s v — 1 and the line entering v has the same momentum as the exiting line. In that case (I5.7[) 
has to be interpreted as Vt(^ • v) = 3v- m particular it is independent of u> ■ v if p v = 1. If T" consists of 
only one node (and hence contains no line) we say that T is a cluster on scale —1. 

The simplest self-energy cluster one can think of consists of only one node v, but then (|5.6|) implies 
v v = 0. For the model in Section [3.11 the corresponding value is zero, and hence the simplest nontrivial 
self-energy clusters contain at least two nodes. On the contrary for the models in Sections 13.31 and 13.41 
one can also have clusters with only one node. 

The reason why it is important to introduce the self-energy clusters is that if we could neglect them 
then the product of small divisors would be controlled. Indeed, let us denote by 9t n (0) the number of 
lines on scale n which do exit a self-energy cluster, and set 9t* (9) = s Xt n (9) — 9\ n (9). Then an important 
result, known as the Siegel-Bryuno lemma, is that 

m* n (9)<c2- n ^K(9), K{9):= £ M, (5.8) 

veN(e) 

for some constant c, where r is the Diophantine exponent in ljl.5|) . See Appendix [X] for a proof. 

If no self-energy clusters could occur (so that 9\ n (9) — 0) the Siegel-Bryuno lemma would allow us to 
bound in (|5.5p 

oo oo oo 

jj 2 m o <n„(0) = -Q 2 nio9£(9) < 2 n °*° k jj 2 mo<n '^ < Cf exp U{n^K (0)) , (5.9) 

n— n— n— no + 1 

with Ci = 2 n " la and 

OO 

Z(n ):=i c n2 ~ U/T - ( 5J °) 

n— no + 1 

Since ^(^o) -» as no -> oo one can fix no in such a way that £(/io) < (the constant £ being defined 
after (|2.6p ). Then, by extracting a factor e~^ Uv ^ 2 from each node factor $ v , one could easily perform the 
sum over the Fourier labels, so as to obtain an overall bound e - ^'/ 2 on for a suitable constant 
Ci. This would imply the convergence of the perturbation series (I2.1[) for £ small enough, say for \e\ < Sq 
for some Eq > 0. However, there are self-energy clusters and they produce factorials, as the example in 
Appendix fAl shows, so that we have to deal with them. 



6 Resummation of the series 

Let us come back to the equation (|2.11[) . To simplify the analysis, let us initially assume that we are 
using the sharp multiscale decomposition (|5.1[) , so that each momentum fixes uniquely the corresponding 
scale. If we take the tree expansion of the right hand side of (|2.11j) . according to the diagrammatic rules 
described in Section rjj we can distinguish between contributions in which the root line exits a self-energy 
cluster T, that we can write as 

£ V T {u- V )u^' k(T) \ (6.1) 

T:k(T)<k 

and all the other contributions, that we denote by [F(u, wt)]^ c_1 ' ) *. In (|6.ip both the entering and exiting 
lines of T have the same scale nr , and the sum is over all clusters T on scale < tit ■ 
By writing (|2.1ip as 

min{£:,fco } 

6o(lo-v)u^ = - ^-v)u^- p) + [F{uM)t~ 1) (6.2) 

P =i 
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we can shift the contributions (|6.1[) to the left hand side of (|6.2p . so as to obtain 

5o(u-i>)uW- Yl ^(w^^'^^afjf 11 *. (6.3) 

T:k{T)<k 

By summing over k and setting 

oo 

M(w • v,e) := ^e k ^ V t {uj-v), (6.4) 

fe=l T:k(T)=k 

then (|6.3p gives, formally, 

I>(w-i/,E)u v = [F(w,a;t)]*. D(w • u, e) := S (u ■ v) 1 - M(u ■ v,e). (6.5) 

The motivation for proceeding in this way is that, at the price of changing 5o(w ■ v) into T>(u> -v,e), hence 
of changing the propagators, lines exiting self-energy clusters no longer appear. Therefore, in the tree 
expansion of the right hand side of the equation, we have eliminated the self-energy clusters, that is the 
source of the problem of accumulation of small divisors. 

Unfortunately the procedure described above has a problem: M(w ■ v, e) itself is a sum of self-energy 
clusters, which can still contain some other self-energy clusters on lower scales. So finding a good bound 
for M{u) ■ v, e) could have the same problems as for the values of the trees. 

To deal with such a difficulty we modify the prescription by proceeding recursively, in the following 
sense. Let us start from the momenta v which are on scale n = 0. Since the only self-energy clusters T 
with riT = are those (on scale —1) containing only one node, for such v the matrix M(u> ■ v,s) is just 
a constant. Next, we pass to the momenta v which are on scale n = 1, and we consider (|6.5[) for such 
v\ now all self-energy clusters T whose values contribute to M(u> ■ v,e) cannot contain any self-energy 
clusters, because the lines £ € L(T) are on scale ne = 0. Then, we consider the momenta v which are on 
scale n = 2: again all the self-energy clusters contributing to M(u> ■ u, e) do not contain any self-energy 
clusters, because the lines on scale n = 0, 1 cannot exit self-energy clusters by the construction of the 
previous step, and so on. The conclusion is that we have obtained a different expansion for u(u>t, e), that 
we call a resummed series, 

oo 

where the self-energy clusters do not appear any more in the tree expansion and the propagators must 
be defined recursively, as follows. The propagator 25^ of a line I on scale ng = n and momentum = v 
is the matrix 

G/:=GM(w -i/,e)= (s (uj-v)t-M ln - 1] (u}-v,sf) , (6.7) 

with 

M [n \uo-v,e) := ^ e k( - T) V T (^ • v), (6.8) 

Ton scale <n 

where the value Vt(^ • v) is written in accord with (|5.7|) . with all the lines £' £ L(T) on scales < n 
and the corresponding propagators 0^ expressed in terms of matrices M\ n '-'\uj ■ vi>,s) as in (|6.7j) . 

By construction, the new propagators depend on e, so that the coefficients ul fc '(e) depend explicitly 
on e: hence (|6.6p is not a power series expansion. 
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If we use the smooth multiscale decomposition, then the algorithm above must be suitably modified. 
We define recursively the propagators &£ = G^ nt \uj ■ i>i, e) by setting for n > 

G [n] (uj ■ v,e) = 9 n (u> ■ v) (s (uj ■ v)t - M [n ~ 1] {uj ■ v,e)\ * (6.9a) 

M [n] (x,e) =M [n - 1] (x,e) + E n (x) M^(x,e), MW(i )£ ) = ^ e k{T) V T {x), (6.9b) 

Teiz„ 

where TZ n is the set of self-energy clusters on scale n which do not contain any other self-energy clusters. 

The matrices M\ n ^{u ■ v,e) are called the self-energies. The new propagators (|6.9a[) are called, by 
exploiting once more the analogy with quantum field theory, the dressed propagators. 

The coefficients iti (s) still admit a tree expansion 

4%)= E Val ^' Val(0):=( J! II «*) "^0, fc >!. (6-10) 

8eT^ v veN(e) leL(d) 

which replaces (|4.8j) . In particular T^\, is defined as the set of renormalised trees of order k and momentum 
u, where "renormalised" means that the trees do not contain any self-energy clusters. 

Since for any tree 9 G T^ v one has 9t ra (#) = 9t* (9), we can bound the product of propagators according 
to (15. 5|) and (|5.9|) , provided the propagators on scale n can still be bounded proportionally to 2 ra ° . In 
general there is no reason why this should occur, because of the extra term .M'™ -1 ' {lo ■ v, e) appearing in 
(1633 . 

The discussion of such an issue depends on the particular model one is studying. We shall see in 
the next sections what happens for the models considered in Section [3] We shall first consider cases in 
which the dressed propagators can be essentially bounded as the old ones, and then cases in which this 
is no longer true. By modifying further the resummation procedure described above, we shall see that 
something can still be achieved also in these cases. 



7 Cancellations and convergence of the series — maximal tori 

Let us consider the matrix A4^(x, e) introduced in (|6.9b[) . and let us study its dependence on the first 
argument x — lj ■ v for the models of Section [31 As usual, let us consider first the model in Section 13.11 
It is a remarkable cancellation that M\ n '{x, e) vanishes in x up to second order, that is M^ n \x, e) = 
0(x 2 ). The symmetry properties 

M [n] (x,e) = (M [n] (-x,e)) T - (M [n] (x, e))t, (7.1) 

with T and f denoting transposition and adjointness, are essential for the proof (such properties are 
trivially satisfied for n — 0, and can be proved by induction on n - see |47[ 134) for more details). 
Indeed, by using l|7.1[) . the cancellation M^ n \0,e) = can be proved as the cancellation [F(u,ut)]^ — 
[d a f(a)]^ = discussed in Section l4~4l with the exiting line of the self-energy clusters playing the role 
of the root line. The first order cancellation requires d x M.^(0,e) = 0, and this can be proved through 
a similar cancellation mechanism: besides the exiting line one has to detach also the entering line and 
reattach it to all the other nodes inside the self-energy clusters. If the function / in (|3.ip is even in a, 
then the first order cancellation follows also from parity properties [32] . 

Both the cancellations and the symmetry properties are only formal as far as we have not proved that 
the self-energies are well-defined quantities. To this aim we need to control the product of propagators 
in (|5.7[) , with the propagators defined according to (|6.9p . An important ingredient of the analysis is that 
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also for all self-energy clusters one can prove a bound like (|5.8p . More precisely, if we denote by O^^T) 
the number of lines I G L(T) on scale n, for all n > and all T £ 7?. n one has 

K'(T)<c'2-"'/r# (T); A"(T):= ^ |i/„|, n'<n, (7.2) 

for some constant c'. To prove (|7.2p one first show that, for all n > and all T € 7?.„, one has 

J2 H>c"2 n ^, (7.3) 

for some constant c", then one proceeds by induction on the order k(T); see Appendix [Bl for details. 

Therefore, if we were able to prove for the dressed propagators an estimate like ||G^(x,e)|| < 2/x 2 , 
then we could use (17.211 and (17.311 to bound 



(7.4) 



n n^ii<7o- fe 2 3fe n 22 "' K ' (T) ' 

£eL(T) n'=0 

in such a way to obtain 

\V T (wv)\ < C 2 fe(T) e-«^ T )/ 2 , (7.5) 

for a suitable constant Ca, independent of T. In particular, this would ensure the well-definedness of the 
self-energies. At this point, the cancellations would allow us to write, for some constant C and for all 
n > 0, 



M [n] (x,e) = s 2 x 2 M l >,£), 



M (x,e) 



<C, 



(7.6) 



where we have taken into account also that Vt(%) requires k(T) > 2 (cf. Section H}. In turn (|7.6I 
would implies that, for e small enough, 



r x 2 -e 2 x 2 M [n - 1] (x,e) 



< 



(7.7) 



The bounds (|7.7|) on the propagators, the symmetry properties, and the cancellations are proved all 
together, as follows. First note that, if the second order cancellation holds, one can write 



e 2 M m (x,e) 



dt (l-t)d 2 M [n] (tx,e), 



(7.8) 



so that the bound in (|7.6p is essentially a bound on the second derivative of the self-energies. The case 
n = is easily checked. Then the proof proceeds by induction, by relying on the recursive definition 
of M\ n '{x,e) - see (|6.9I) and (|5.7[) -, and taking advantage of the smooth multiscale decomposition to 
perform the derivatives. More precisely, we assume that both the cancellations - and hence the bounds 
(|7.6p - and the symmetry properties (|7.1[) hold for all n' < n. This means that all the dressed propagators 
of the lines on scales < n are bounded according to (17. 7|) . so that we can use the bound (|7.5p to prove 
that also M\ n \x,e) is well-defined. Then the cancellation mechanism described at the beginning of the 
section shows that also at the step n the symmetry properties (|7.ip and the cancellations leading to (17. 6p 
are satisfied; in particular also the propagators G^ n+1 \x,e) are bounded proportionally to \x\ 2 according 
to (1771) . 

The conclusion is that the series in (|6.6p for u v = c*„ converges for e small enough. Therefore, the 
function u(wt,e) = a(u)t,e) is analytic in e (notwithstanding that the expansion in e is not a power 
expansion), so that we can say a posteriori that the original power series (|3.14p also converges. It is 
straightforward to see that av decays exponentially in f, which implies that the function a(xp,e) is also 
analytic in ip. 

The case of the standard map - see Section l3~2l - can be discussed in the same way. We do not repeat 
the analysis and refer to [37l[T0] for details. 
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8 Summation of the divergent series — hyperbolic tori 



Now we consider the model introduced in Section [3.31 In that case, one has 



MW(*,*)=§ e3|/!(/3 ))+°( £2 )' ^ 

so that, already only keeping the first order terms, one realises that a cancellation like ()7.6|) cannot 
expected to hold. Indeed, in order to study the convergence of the series (|6.6|) . we need at least the 
perturbation series (|2.1|) to be formally well-defined to all orders; in turn this requires matrix dpfo(f3 Q ) 
to be nonsingular - see Sections 13.41 and 14.41 - and hence different from 0. 

Let us assume first that the matrix dpfo(/3 ) is positive definite, that is that its eigenvalues Oi, . . . , a s 
are positive, i.e a% > for i = 1, . . . ,s (in particular this means that /3 n is a maximum point for the 
function f (/3)). 

We write both the self-energies and the propagators as block matrices, 

where the four blocks are r x r, r x s, s x r, and s x s matrices, respectively. 

Then, one can prove that the parity properties (|7.1[) still hold, and moreover, formally, one has the 
cancellations 

M%l(x,s) = 0(e 2 x 2 ), M™(x,e)=M [ Z ] a (x,s) = 0(s 2 x). (8.3) 

The proof of such assertions can be performed by induction, and follows the same pattern as described in 
Section 0- see [351131] for details. We have used the word "formally" because the cancellations hold as 
far as the dressed propagators (|6.9a|) can be bounded essentially as the old ones (|5.4p - a property that 
we have not yet proved. 

The main implication of (|7.ip and (|8.3p is that the eigenvalues A| n '(a;,e) of the self-energies M^- n \x,e) 
are of the form 

4"W) ={ 0(s ° x 7o<^ i=1, :," r ' ,, < 8 ' 4 » 

\ai- r e + (J(e ), i = r + 1, . . . , a. 

In particular, if e < 0, the eigenvalues x 2 — a[™' (x, e) of the matrices 5q(uj ■ v) 1 — (x, s) are such that 

x 2 — A,["'(x, e) > x 2 /2 for i = 1, . . . , r and x 2 — A[ n '(a;, e) > x 2 + \ea,i~ r \/2 for i = r+ 1, . . . , d, provided e is 
small enough and the higher order corrections in (|8.4[) remain small. The last property is automatically 
satisfied if the block matrices in (|8.3p are dominated by the first nontrivial orders. 

All the properties described above become rigorous if the dressed propagators G^(x,e) are bounded 
proportionally to x~ 2 (more generally any power of |x| would suit), say 

GM(z, £ )|<^. (8.5) 

So, all we have to do is to prove together all the above properties (|7.ip and (|8.3p . by induction. Indeed, 
for n = the properties are trivially satisfied, and at any step n, by the inductive hypothesis, the bounds 
||G[ n 'l(a;,e)|| < 2/x 2 are satisfied for all n' < n, so that both (|7.ip and (|8.3|) hold for n, and in turn this 
implies (|8.4p and hence the bound (|8.5p for n + 1. 

Therefore, the series (I6.6P converges for e small enough, even if analyticity is prevented because of 
the condition e < 0; then we say that the perturbation series (|2.ip is a divergent series (in point of fact, 
it is very likely that it does not converge, though there is no proof of that). The function is only C°° in 
e at e = - such a result improves a previous one by Treshchev [79], where C°° -smoothness in y/e was 
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proved at e = 0. In fact, one can say a little more about the dependence of the invariant torus on the 
perturbation parameter e in the complex e-plane: the lower-dimensional torus turns out to be analytic 
in the heart-shaped domain of Figure 35] . 

Ime 

\ HO f 



Figure 5: Analyticity domain for the hyperbolic invariant torus. 

As said in Section 13.31 the torus will be said hyperbolic in that case. This is a somewhat improper 
terminology. Indeed, when studying lower-dimensional tori one usually considers Hamiltonians of the 
form [53E2EE1E3] 

W(a, q, A, p) = X -A 2 + ip 2 + ]T \%q\ + ef(a, q), (8.6) 

i=l 

(or generalisations of its), with (a, A) E T r x R r and (q,p) £ M. s x M. s . Thus, for e = the coordinates a 
freely rotates with some rotation vector u> = (u>i, . . . ,uj r ), while the coordinates (q,p) moves around an 
equilibrium point, which is elliptic if Ai > for alH = 1, . . . , s and hyperbolic if A.; < for all i = 1, . . . , s. 
The numbers u>x, . . . , u> r are called the proper frequencies, while the numbers Ai , . . . , A s are called the 
normal frequencies. 

Then one can study the problem of persistence of lower-dimensional tori under perturbation, that is 
for e ^ 0. The Hamiltonian (|3.18|) can be interpreted as a Hamiltonian of the form (|8.6p with = 0(e). 
In that case one usually say that the lower-dimensional tori are parabolic. However, we can also interpret 
the persisting tori, in the case ai > and e < 0, as degenerate hyperbolic tori, where "degenerate" refers 
to the fact that the normal frequencies vanish at e = 0. 

Of course if the matrix dpfo(f3 ) is negative definite, the same result of persistence of hyperbolic 
invariant tori holds for e > 0. The case of indefinite (i.e. neither positive nor negative defined) matrices 
will be considered at the end of Section [TO] 

9 Summation of the divergent series — dissipative systems 

The discussion of the model (|3.26p introduced in Section 13.41 proceeds very closely to the case of the 
hyperbolic tori of Sectional In that case n — 1, hence both the propagators and the self-energies are 
scalar. One has, formally, 

M [n] (x,e) = -(iex) 2 + M [n] {x,e), M [n] (0, e) = ae + 0(e 2 ), a := d x g(c ) ^ 0. (9.1) 

If for all n' < n the dressed propagators G^ n l(w • v,e) can still be bounded proportionally to (as 
the undressed ones), the terms 0(e 2 ) are defined by a convergent series, so that 

M [n] (x,e) =M [n] (0,e)+x f dtd x M [n] (tx,e), d x M [n] (x, e) = 0(e 2 ), (9.2) 

Jo 
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and hence 



S (x) -M ln ~ 1] {x,e) = ix- M ln ~ 1] (x,s) = ix (1 + lex) - ae - 0(e 2 ) - 0(e 2 x), 



oj-v, (9.3) 



and hence 



q[u] 



x,e 



< 



S (x)-M [n - 1] (x,e) 



< 



(9.4) 



for e small enough. Actually one can prove that vW™'(0, e) is real for real e [5T], a property which becomes 
essential to deal with the case in which the nondegeneracy condition a ^ is not satisfied - see Section 
below. 

Again the properties (|9.ip and (|9.4p are proved together, by induction on n: for n = they trivially 
hold, and, by assuming that they are satisfied up to n — 1, one sees that the series defining M\ n ^{x,e) 
converge, and hence (|9.1|) can be proved for n; see [32 001 HI] for details. 

One can study the dependence of the response solution on e in the complex domain. Of course one 
expects an obstruction to analyticity along the imaginary axis (so as it happened along the positive real 
axis for the hyperbolic tori). In fact one can prove that the response solutions are analytic in two disks 
tangent to the imaginary axis at the origin - see Figure El — ; of course only the disk to the right is 
physically relevant, as it corresponds to e > and hence to 7 > 0. 

line 





Ree 



Figure 6: Analyticity domain for the response solution. 



An interesting remark is that, while the periodic case (d = p = 1) is trivial in the case of lower- 
dimensional tori, this is no longer true for the model (|3.26p . Indeed, if one takes r = 1 in Section 
13.31 then no small divisors appear, so that the perturbation series is easily proved to be convergent (in 
particular analyticity in e follows in that case). On the contrary if one takes d = m = 1 for the model 
(|3.26|) . one can still have arbitrarily large powers of \u\ because 5i(u> ■ v)/5q(u) ■ v) — iu ■ v. then, it is 
straightforward to see that one can construct trees whose value grows like a factorial - see Appendix 

This means that also in the case of periodic forcing, the response solution to (|3.26p is not analytic in 
s. However, one can prove that such a solution is Borel summable [42, 43J; we recall in Appendix [Dl the 
definition of Borel summability - see also [TH EHl HE] ■ Note that an equation like (|3.26p with periodic 
forcing (and with g(x) — x^ for fi S [1.5, 2.5]) naturally arises in electronic engineering, and is known as 
the varactor equation [7]. 

It is proved in [43j that also in the case of quasi-periodic forcing, the response solution turns out to be 
Borel summable if d = 2 and r = 1 - that is in the case of frequency vectors with components whose ratio 
is an irrational number of constant type [77] . A similar situation is encountered in the case of hyperbolic 
tori: also in that case the function u(u>t, e) is Borel summable if d = 2 and r = 1 [25] . 
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10 Cantorisation — elliptic tori 



Let us come back to the system of Section f3.31 still assuming that the eigenvalues a±, . . . , a s of the matrix 
dlfo( 
JO 



d%fo(0o) are all positive, but taking e > 0. In that cases, already for n = — 1 one has - see (|6.8|) and 



v ed 2 p fo([3 

so that the eigenvalues x 2 — \\ 1 ' (x, e) of the matrix Sq(x) 1 — (x, e) are 



x*-\^\ x ,e) = \ x l> ' (io.2) 

I a; — eat-,. , i = r + 1, . . . , a. 

Hence, for fixed s, we have problems for all i/£Z r such that u> • v is too close to some value i^/c^e. So, 
to give a meaning to G^ (x, e) we must require some further Diophantine conditions, say 

\\u-v\- ^/£5Ti\ >7|H" T ' Vi/eZ r \{0}, (10.3) 

for some Diophantine exponent r' > d. This can be achieved at the price of eliminating some values of 
e. Fixed £o > small enough, the subset of values e S [0, £o] for which all Diophantine conditions 
(|10.3p are satisfied has large Lebesgue measure in [0,£o], in the sense that 

km mCaS(£ -i n[M) = 1, (10.4) 

e^0+ e 

provided r' is chosen larger enough than r, say r' > r + r; see Appendix El The property (110. 4jl can be 
stated by saying that <B'_ 1 has a Lebesgue density point at e — 0. 

For all values e S (S^ the propagators G'°'(x, e) = v E'o(a ; ) (^o(^)l — .M' -1 ^!,^)) -1 can be formally 
defined. At this point, one could hope to iterate the procedure. The main obstacle is that now the 
dressed propagators are no longer bounded proportionally to the undressed one: indeed it may happen 
that much smaller than x 2 . So we have to modify the algorithm. 

For simplicity, let us first reason once more by taking a sharp decomposition as initially done in 
Section [5) Let us also assume, in the discussion below, the self-energies to be well-defined: we shall back 
later to such an issue. 

We say that v ^ is on scale if \u> ■ v\ > 7 and on scale [> 1] otherwise: for v on scale we 
define G^(uj ■ v,e) as in (|6.7p . with n = and Ai^ x \x, e) given in (110. ip . Given v on scale [> 1] 

we say that v is on scale 1 if minj=i d li 10 ' v ) 2 ~ A| ^(^ ' v i £ )\ — (2 _1 7) 2 5 and on scale [> 2] if 
min*=i,...,d l( w ' v ) 2 ~ ' v i e )\ < (2 _1 7) 2 - For v on scale 1 we write G^(x,e) as in (|6.7p , with 

n = 1 and A4^(x, e) written according to (|6.8p . Call A' '(x, e) the eigenvalues of M\ ^ (x, e): given ^ 



on scale [> 2] we say that is on scale 2 if mini = i <j |(cj • v) 2 — A,[°'((jJ ■ v,e)\ > (2 2 7) 2 , and on scale 
[> 3] if miiXj^i,...^ \(u> ■ v) 2 — X, (u> ■ < (2~ 2 7) 2 . And so on: eventually we impose infinitely many 
Diophantine conditions, i.e. 

|w • v \ ~ V l A l nl ( w ' > 2~ (n+1)/2 7M~ T ' Vi/eZ r \{0} (10.5) 

for all z = 1, . . . , d and n > — 1. 

Even if we are successful in imposing the conditions p0.5p . the argument above is still incomplete, 
and it needs a further modification. In order to bound the tree values and the self-energies we need a 
bound on the number of lines of fixed scale, in the spirit of the Siegel-Bryuno lemma. This requires to 
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compare the propagators of the lines entering and exiting clusters which are not self-energy clusters - see 
the discussion at the end of Appendix [Fl - and this leads to further Diophantine conditions, 



>2 -(n+l)/2 7 |^_ iy2 |-r' V^^^eZ^O}, (10.6) 



for all i,j = 1, . . . , d and n > — 1. For any fixed v this would mean to impose the conditions for all V\ 
and vi such that v\ — v-i = v. Unfortunately, these conditions are infinitely many, and for all of them we 
would eliminate intervals of the same size: therefore we would left with a zero measure set. Of course, at 
the step n — —1, there would be no difficulty, since the eigenvalues \\ (x,e) are independent of x - see 
(|10.2[) but already at the step n = problems would arise. 

So, instead of (|10.5p and ()10.6p . we can try to impose the Diophantine conditions 



> 2 -(„+i)/2 7 | !/ |- T ' v^eZ r \{0}, (10.7a) 



u-v\± 



V / Aj^± V / Ip(I) 



> 2 {n+1)/2 1 \v\- T ' VveZ r \{0}, (10.7b) 



for all i, j = 1, . . . , d and n > 0, for suitable numbers A]' (e) independent of v. The advantage of (|10.7p 
with respect to (|10.6[) is that for any n we have to impose that the quantities w ■ v are far enough only 
from a finite number of values, that is d values for (|10.7a[) and < Ad 2 values for (|10.7b[) . 

Thus, already for n = —1 we have to impose, besides the conditions (|10.3p . also the conditions (|10.7bjl . 
This leaves a subset C <£'_±. To prove that the set has still a Lebesgue density point at s = 0, 
we need a lower bound on all the derivatives d(y / ?a7 ± v /eaJ)/de, with i ^ j when the sign minus is 
considered. This is easily obtained if we assume that the eigenvalues ai, . . . ,a s are distinct, i.e. that 
there exists ao > such that |aj — dj\ > a for all 1 < i < j < s; see Appendix [El Of course, this provides 
a further assumption on the function fo((3). 

To deal with the cases n > we define, iteratively, 

r i f 0, i = 1, . . . , r, 

A n] (e) = 1 ' i . T . (10.8) 

In this way we obtain both that the eigenvalues x 1 — a[™' (x, e) are bounded in terms of the quantities 
x 2 — A["'(e) and that the sequences {A[ n '(e)}^_ 1 converge exponentially fast for all i = r + 1, . . . , d, 
that is |A'™'(e) — A^™" 1 '^)! < Kie~ Kl2 " /T e 2 for suitable positive constants K\ and K\\ see Appendix [Fl 
Again, in order to impose the conditions (|10. 7b|l we need a lower bound on the derivatives d(yAj n '(£) ± 



Xj (e))/de, but these can be discussed as in the case n = —1; see Appendix lEl 
The discussion above is correct as far as the self-energies are well-defined - which we have simply 
assumed to be for the moment. For instance, only if this is the case, when we write A[™' (e) = a,i- r £+0(e 2 ) 
for i = r + 1, . . . , d, we can really say that the high order terms are negligible with respect to the liner 
ones. As in the case of maximal and hyperbolic tori, we prove by induction that the self-energies are well 
defined. To this aim, we need bounds on the number of lines on scale n: one can prove that bounds of 
the form ()5.8j) and (17. 2|) still holds, but with r' instead of r - since the Diophantine conditions involve 
the Diophantine exponent r'; see Appendix [FJ Up to this difference, the strategy of the inductive proof 
is exactly as in the previous cases. 

The Diophantine conditions ()10.7a[) and (j!0.7b[) are known as the first Melnikov conditions and second 
Melnikov conditions, respectively. Each condition shrinks further the set of allowed values of e: if £ n -i 
is the set of allowed values found at the step n — 1, then imposing the conditions (j!0.7|) leaves a subset 
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<£„ C <£ n -i- At each step the set of values which are removed has measure proportional to a common 
value times 2~ (n+1 )/ 2 : it was to obtain this exponential factor that a factor 2~( n+1 ^ 2 was introduced in 
(|10.7p . Thus, eventually one is left with a set (Eqo, which is still of large measure. A closer inspection of 
(Boo reveals that €oo is a Cantor set (that is a perfect, nowhere dense set). 

To make the argument above really rigorous one should take a smooth decomposition, such as that 
considered in Section ]E\ Moreover, it turns out to be convenient to use functions A'™' (e) which are 
differentiable (in the sense of Whitney [72]) in e, so that, instead of the minimum of the eigenvalues, one 
should consider a smooth version of it - see Appendix IF1 for details (see also [38 j l45 l 139]). 

The assumptions on dpfo((3 ) can be weakened by requiring that the eigenvalues a±, . . . , a s are such 
that a, for all i and a, — cy ^ for all i ^ j. In this case lower- dimensional tori of mixed type can 
be proved to exist 03 [35] . 

11 Stability and uniqueness 

The quasi-periodic solutions describing the maximal tori are linearly stable [3J- An interesting problem 
is that of uniqueness of solutions. In other words, one can wonder whether there are other quasi-periodic 
solutions with the same rotation vector ui as the solution studied in the previous sections. Despite the 
apparent simplicity of the problem, a proof of uniqueness has been given only recently [31j . 

The case of lower-dimensional tori is more difficult. In principle there could be other quasi-periodic 
solutions with the same rotation vector, which either do not admit any perturbation expansion or admit 
a different expansion or, even admitting the same expansion, are different. For instance, the resummed 
expansion (|6.6|) a priori depends on the particular way the multiscale analysis is implemented, and by 
slightly changing the procedure one could obtain a different solution: this would imply infinitely many 
solutions which have the same formal perturbation series. In the case d = 2 and r = 1 all such functions 
coincide, because they are all Borel summable - see the last remark in Appendix|D]- but in general there 
is no reason why this should happen. At the present moment the problem of uniqueness is still open. 

In the case of the dissipative systems of Section 13.41 one expects, on the ground of physical con- 
siderations, the response solution to be either attractive or repulsive (which means attractive for the 
time-reverted dynamics). More precisely, under the further assumption that d x g(co) > the response 
solution is expected to be asymptotically stable. Indeed, this is what happens. 

The proof - very easy - proceeds as follows [I] . The analysis of Section [H shows that there exists a 
response solution Xo(t) — x(urt,e) — cq + 0(e). If we look for solutions of the form x — xq + £: then 
x(t) — > xo(t) as t — > oo (i.e. x is attracting) if and only if — > as t — > oo. The function £ must 
solves the differential equation 



If we define R(£,t) = P(£,x (t))/P(£,co) we have R(0, t) = l + 0(e), so that 1/2 < R(£,t) < 2 uniformly 
in t and £, for e and £ small enough. Then we can rescale time and variables by setting 



a+rt + p(t,x (t)) = o, 



P($, x) = g(x + - g(x) = d x g(x) £ + 0(£ 2 ), 



(11.1) 



which can be rewritten as a system of first order equations 




(11.2) 





y(t) = y/R(£(t),t)w(T(t)), 



(11.3) 
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which transforms the system (|11.2p into 



(11.4) 

w' = -7(1;, t) w - P(v, c ), 



with the prime denoting differentiation with respect to time r and 

^■'M^m)- (1L5 > 

If we neglect the friction term j(v, t) w in (|11.4p we obtain an autonomous system with constant of motion 

H(v,w) = ^w 2 + J dv'P(v',c ) = ^w 2 + ^d x g(c )v 2 + O(v 3 ), (11.6) 

so that the origin is a stable equilibrium point. Moreover j(v, t) > for e small enough (recall that 
7 = 1/e), in a neighbourhood U of the origin. Hence we can apply Barbashin-Krasokvsky's theorem 
[B"2"] (or Lasalle's invariance principle [53] ) to conclude that the origin is asymptotically stable and U is 
contained in its basin of attraction. 

In some cases, for instance if g(x) = x 2p+1 , p £ N, and /o / (so that d x g(co) > 0), the response 
solution turns out to be a global attractor [3], but of course in general it is only locally attracting. 

The problem of uniqueness, mentioned about the lower-dimensional tori, can be addressed also as 
to such response solutions. Of course, the local attractiveness of the solution implies local uniqueness. 
In other words, the response solution x{uj_t, e) is the only quasi-periodic solution which reduces to cq as 
0. 



12 Generalisations 

In this last section, we review some possible directions one can follow to generalise the results described 
in the previous sections. Some of the these generalisations are discussed in the literature, other are still 
open problems. 

12.1 Weaker Diophantine conditions 

Instead of the standard Diophantine condition (|1.5p one can consider weaker conditions, such as the 
Bryuno condition [T^]: a vector u> is said to satisfy the Bryuno condition if B(u>) < oo, where 

oo 

6(cj) = V— log — p-r-, o n (w)=inf{|w-i/| : ^ e Z d such that < \v\ < 2"}. (12.1) 
^-J 2™ a n {(j) 

All the results of the previous sections can be extended to rotation vectors satisfying the Bryuno condition: 
see |39j for maximal and lower-dimensional tori, [5] for the standard map, and |40j for dissipative systems. 

For d — 2 one can write lo = (oji,^) = (1, a)o;xj where a = o^/^i is the rotation number. One can 
define the Bryuno function 

oo ^ 

where {g n }nez are the denominators of the best approximants of a |77j . Then the function B(co) is 
equivalent to the Bryuno function B(a), in the sense that one has C" 1 < B(uS)/B{a) < C for a universal 
constant C l39l . 



25 



An open problem is whether such a condition can be further weakened. For d — 2 the Bryuno 
condition is optimal: indeed, in the case of the standard map Davie 27J proved that if the rotation 
number oj g R does not satisfy the Bryuno condition then there is no invariant curve with that rotation 
number to. Such a result can be even strengthened by saying that the radius of convergence of the 
conjugating function and the function B(u>) are such that 



for suitable universal constants C\ and C2; see [371 E| f° r a proof of the last statement. The proof of the 
lower bound in (112. 3j) relies on deeper cancellations than those discussed in Section [7J see [9] for details. 

Another Diophantinc condition considered in the literature is the so-called Riissmann condition [75, 
[751 [75] , which has a somewhat intricate definition if compared to (|12.ip . For d = 2 such a condition is 
equivalent to the Bryuno condition [76] . 

12.2 Degenerate perturbations 

The dissipative systems introduced in Section [3.4l have been considered under the nondegeneracy condition 
that d x g(co) if g(co) — fo- Such a condition can be removed, and the existence of a response 
solution can be proved under the only condition that there exists a zero cq of odd order to the equation 
g(x) — fo = 0; see [10], where it is also proved that there is no response solution reducing to cq as e — > 
if Co is a zero of even order. 

Also in the case of lower-dimensional tori one can think of relaxing the nondegeneracy condition that 
the matrix dpfa((3 ) be nonsingular. In full generality, this case is very hard. The case s = 1 is already 
nontrivial. In that case it has be proved that at least one lower-dimensional torus always persists [20] . 

A first difference with respect to the nondegenerate case considered in Section 13.31 is that a formal 
power series in s does not exist any longer. In [36j it is shown that, at least in some cases, a fractional 
power series in e can be envisaged. The situation is somewhat reminiscent of what happens in Melnikov 
theory [661 1221 [5"4"j . when the subharmonic Melnikov function has degenerate zeroes of odd order: in that 
case subharmonic solutions exist and are analytic in a fractional power of e (Puiseux series [741 118] ) 
- cf. [33] [25]; see also [TJ [70] for a similar situation in the case of limit cycles. In the case of lower- 
dimensional tori, the fractional power series in e do not converge, but they can be resummed in order to 
give well-defined functions - see |36] for a complete discussion. 

12.3 More general systems 

One can also consider ordinary differential equations more general than , say of the form 



with Fq real analytic. In that case one still assumes that the unperturbed equation Dq(u) = Fq(u) admits 
a quasi-periodic solution uo(ujt). 

The most general formulation of KAM theorem is within this class - see the Hamiltonian p.2j) . The 
analysis performed in the previous sections for the simplified Hamiltonians (|3.1|) and (|3. 18|) can be ex- 
tended to deal with these Hamiltonians; we refer to [571 133 f° r details. If the perturbation depends 
explicitly on the action variables, then the formal solubility of the Hamilton equations relies on a non- 
degeneracy condition of the unperturbed Hamiltonian, such as the invertibility of the matrix d^Ho(A) 
(anisochronous condition). However, the KAM theorem can be extended also to isochronous systems 
with Hg(A) = u> ■ A, by assuming a nondegeneracy condition on the perturbation, for instance that the 
matrix 



(12.3) 



D E u = F (u) + eF{u,wt), 



(12.4) 




(12.5) 
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is invertible [53] . A proof along the lines of the previous sections passes through the so-called translated 
torus theorem [69] (also knows as theorem of the modifying terms or theorem of the counterterms) , which 
says that, for any analytic function /: T™ x R n and any Diophantine vector u> E R™, there exists a vector 
li(e,u)) analytic in e, such that the equations 

ia = ui + ed A f(at, A) + fi(e,u), 

\A = -ed a f(ot,A), [ ' ' 

admit a quasi-periodic solution with rotation vector u> which is analytic in e. A proof of such a theorem by 
using the tree formalism can be found in [5] . The theorem of the translated torus can be also formulated 
in Cartesian coordinates; in that case the cancellation mechanisms leading to the convergence of the scries 
work in a rather different way [24j . 

In Section [3~Tl we have considered only analytic Hamiltonians. A more general formulation of the KAM 
theorem requires only finite smoothness 68, 72 . In certain cases, the tree formalism can be extended 
to nonanalytic systems, such as some quasi-integrable systems of the form (|3.3p with / in a class of C p 
functions for some finite p \12\ I13j. However, up to exceptional cases, the method described here seems 
to be intrinsically suited in cases in which the vector fields are analytic - from a physical point of view 
this a quite reasonable assumption. The reason is that in order to exploit the expansion (|2.3p . we need 
that F be infinitely many times differentiable and we need a bound on the derivatives. It is a remarkable 
property that, as shown in Sections [S] and [HI the perturbation series can be given a meaning also in cases 
where the solutions are not analytic in the perturbation parameter. 

Equations of the form (|12.4|) also arise in problems of electronic engineering and theory of circuits, 
usually with periodic forcing. Such systems are resistive and hence intrinsically dissipative. As examples 
one can consider the saturating inductor circuit, described by the equation 

a + v 2 

G(x)x + f3x + e-yx = ef(uit), G(v = a > 1, p > 0, 7 > 0, (12.7) 

1 + tr 

and the resonant injection-locked frequency divider, described by the equation 



x' = ay + 0x (l - x 2 ) + ex (l - x 2 ) f(uJt), a > j3 > 1, 



y' = -x- y, 



(12. 



with m = 1 (i.e. periodic forcing) in both cases. For both equations the dynamics at e — is known: the 
first system (|12.7|) admits a constant of motion (although it is not Hamiltonian) , while the second one 
has a globally attracting limit cycle. By following the same approach as described in Sections and [H 
one can study for e ^ the existence of periodic solutions whose period is rational with respect to the 
period of the forcing - with the major simplification with respect to the previous analysis that no small 
divisors appear. More precisely one can study the existence of subharmonic solutions for the equation 
(112. 71) , and the frequency locking phenomenon for the equation (|12.8p . We refer to the literature [3 [6] 
for details and results. 

12.4 Partial differential equations 

Finally, the analysis developed so far for ordinary differential equations, can be extended to partial 
differential equations, such as the nonlinear wave equation 

d u u — d xx u + [iu = u 3 , x£[0,7r], [i > 0, (12.9) 

and the nonlinear Schrddinger equation 

id t u — d xx u + fiu = \u\ 2 u, x6[0,7r], [i > 0, (12.10) 
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with periodic or Dirichlet boundary conditions. There exists a very wide literature about periodic and 
quasi-periodic small amplitude solutions to nonlinear one-dimensional partial differential equations such 
as (|12.9j) and (|12.10p . starting from the seminal work by Kuksin, Craig and Wayne [B3J [SHI [2S] • Recently 
results have been obtained also in higher space dimension [HI [TBI [30] , that is for x G [0, tt} d , D > 1, with 
periodic boundary conditions. 

By using the tree formalism, small amplitude periodic solutions have been proved to exist, in dimension 
D = 1, both in the nonresonant case - /x in a suitable Cantor set - and in the resonant case - fi = 
[501 1ST] . Results have been obtained also in the higher space dimensional case D > 1 [S5] • We refer to 
the original papers for a precise formulation of the results and the proofs. 

A Proof of the Siegel-Bryuno lemma 

The bound ([5~E]) follows from the fact that if 01* (0) ^ then 01* (0) < E(n, 0) := cK(0)2- n / T - 1, with 
c = 2 2+1 / T . The last bound can be proved by induction on the order k{0) as follows. Given a tree let 
£o be its root line, let £i,. . . ,£ s , s > 0, be the lines on scales > n which are the closest to £q, and let 
6\ , . . . , S the trees with root lines t\ , . . . , l s , respectively - cf . Figure [Tj By construction all lines £ in 
the subgraph T have scales ri£ < n, so that if ng > n then T is necessarily a cluster. Moreover, all trees 
9i,. . . ,0 S have orders strictly less than k(0), so that, by the inductive hypothesis, for each i = 1, . . . , s 
one has either 0^(0*) < E(n,9i) or 9t*(0i) = 0. 




Figure 7: Construction for the proof of the Siegel-Bryuno lemma. 

If either £q is not on scale n or it is on scale n but exits a self-energy cluster then 9T* (0) = 91* (0i) + 
. . . + 91* (0 S ) and the bound 01* (0) < E(n, 0) follows by the inductive hypothesis. If £q does not exit a 
self-energy cluster and ng = n then 9t*(0) = 1 + 91* (0i) + . . . + 01* (0 S ), and the lines £i,. ..,£ s enter 
a cluster T with K(T) = K{8) - {K{0t) + ... + K(0 S )). If s > 2 the bound 9t*(0) < B(n,0) follows 
once more by the inductive hypothesis. If s = then 01* (0) = 1; on the other hand for £q to be on 
scale ni = n one must have \u ■ Pe \ < 2~ n+1 7, which, by the Diophantine condition (jl.8p . implies 
K{9) > \ v to\ > 2(" -1 )/ r , hence E{n,0) > 1. If s = 1 call i^i and ^2 the momenta of the lines £0 and ^1, 
respectively (in particular ^1 = t^ ). By construction T cannot be a self-energy cluster, hence / ^. 
Thus, by the Diophantine condition (|1.5|) . one has 

2-' l+2 7 > \uj ■ Vl \ + \u ■ va\ > \u ■ {vx - v 2 )\ > 1 —— , (A.l) 

because ri£ = n and ni 1 > n, and hence 

#(T)> ^ |^|>|^-^|>2("- 2 )^, (A.2) 

v£N{T) 

hence T must contain "many nodes". In particular, one finds also in this case 01* (0) = 1 + 01* (0i) < 
l + £(n,0i) < l + £;(n,0)-cis:(T)2-"/ T < E(n,6), where we have used that cK(T)2- n / T > 1 by (TO]) , 
provided c = 2 2+1 / T . 
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The argument above shows that small divisors can accumulate only by allowing self-energy clusters. 
That accumulation really occurs is shown by the example in Figure [SJ where a tree 9 of order fc containing 
a chain of p self-energy clusters is depicted. Assume for simplicity that fc/3 is an integer: then if p = fc/3 
the subtree 9\ with root line £ is of order fc/3. If the line £ entering the rightmost self-energy cluster T p 
has momentum v, also the lines exiting the p self-energy clusters have the same momentum v. Suppose 
that \v\ « iVfc/3 and \u> ■ v\ « 7/H 7 " (this is certainly possible for some v). Then the value of the tree 9 
grows like a\(k\) a ' 2 , for some constants a\ and a 2 : a bound of this kind prevents the convergence of the 
perturbation series. 




Figure 8: Example of tree with accumulation of small divisors. 



B Siegel-Bryuno lemma for self-energies 

We first prove (|7.3p . Call Vt the path connecting the entering and exiting lines of T. If T G TZ n then 
T contains at least a line on scale > n. If there is one line £ (f. Vt on scale > n, one has K(T) > \vi\ 
and 7M~ T < \u ■ v t \ < 2" n+1 7, so that K{T) > 2 < - n ~ 1 ^ T . Otherwise, let £ be the line £ G V T on 
scale > n which is closest to the line l\ entering T. Call T the subgraph of T consisting of all lines and 
nodes of T preceding £. By construction, all lines in T have scale < n, and hence vi ^ v>i t , otherwise T 
would be a self-energy cluster. Therefore one has j\vt — vg 1 \~ T < \ui ■ {vi — < 2~ n+2 j, which yields 
K(T)> \u e -u tl \ >2(™- 2 )/^. 

To prove (|7.2p one considers a more general class of graphs. We say that T is a graph of class S(n, n') 
if it has one entering line and one exiting line both on scale > n' and all the lines contained in T are on 
scale < n. Define K(T) and 9l„(T) as done in Section [7] for the self-energy clusters. We want to prove 
the bound 9t*,(T) < max{2i^(r)2( 3 "")/ r - 1,0}. Let V T the path connecting the entering and exiting 
lines of T, and let N(Vt) the set of nodes connected by lines of Vt- If all the lines along Vt have scale 
< n', then 01„'(T) = OT^^i) + . . . + 91„<(# m ), where 6%, . . . , 9 m are the trees contained in T which have 
the root in a node v € N(Vt)- In that case the bound follows from (|5.8|) . If there exists a line £ G Vt 
on scale > n', call T t and T 2 the subgraphs of T such that L(T) = {£} U L{T X ) U L(T 2 ). Both T a and 
Ti are of class S(n,n'), so that, in the case in which both T\ and T 2 contain lines on scale > n', by the 
inductive hypothesis one finds 0^/ (T) < 1 + ^'(Ti) + DT„'(T 2 ) < 2K(T)2 (n - 3 ^ r - 1. If T a contains no 
line on scale > n' then one realises that one must have K(Ti) > 2 ( ™~ 2 )/ T , and the same holds for T 2 , so 
that the bound follows also in these cases. Finally, (17. 2j) follows from the previous bound by noting that 
a self-energy cluster T G is a graph of class <S(n, n') for all n < n'. 

C Accumulation of small divisors for dissipative systems 

We want to construct for the model of Section 13.41 a tree 9 whose value Val(#) grows like a factorial. 
Let 9 be the tree with fc nodes v\, . . . ,Vk, such that s Vi = k Vi = 1 and /9„ 4 = for all i = 1, . . . , k — 1, 
while s„,. = and = fc^ = 1. Let z^,. = ^ such that \u> ■ v\ w 7/|^| T . The value of the labels 
for i = 1, . . . , k — 1 implies that v Vi = for i = l,...,fc — 1, and hence all the lines have the same 
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momentum v. Then one has Val(#) = (iu ■ v) 2 ^ k ^ f v (iu> ■ v) k = (iu> • v) k 2 f v , which can be bounded by 
(k - 2)!(2/£) fc S e-«M/ 2 for large k. 

D Borel summability 

Let /(e) = Y^Li a n£ n a formal power series (which means that the sequence {a n }^ =1 is well-defined). 
We say that /(e) is Borel summable if 

1. B(p) := Ylm=i a n p n /nl converges in some circle \p\ < 5, 

2. B(p) has an analytic continuation to a neighbourhood of the positive real axis, and 

3. g(e) = J °° e~ p l e B(jp) dp converges for some e > 0. 

Then the function B{p) is called the Borel transform of /(e), and g(e) is the Borel sum of /(e). Moreover 
if the integral defining g(e) converges for some eo > then it converges in the circle Ree -1 > Ree^ 1 . 

A function which admits the formal power series expansion /(e) is called Borel summable if /(e) is 
Borel summable; in that case the function equals the Borel sum g(e). 

A remarkable property of Borel summable functions is that if two functions /(e) and g{e) are both 
Borel summable and admit the same power series expansion, then the two functions coincide. 



E Excluded values of the perturbation parameter 



Set a = min{<2i, . . . , a s } and A = max{ai, . . . , a s }. In order to impose the Diophantinc conditions (|10.3p 
we have to exclude all values of e G [0, eo] such that | \uj ■ u\ — y/ea~i\ < 7H~ r for some i = 1, . . . , s 
and some v ^ 0. Of course, we can confine ourselves to the vectors v G 27 such that \v\ > uiq := 
(j/4:y/eoA)~ 1 / T , because one has \u ■ v\ > 4\/eqA and hence ■ v\ — ^/eal\ > \uj ■ v\/2 > r y\v\~ T /2 
for \v\ < rriQ. For all \v\ > mo we can introduce an interpolation parameter t G [—1,1] by setting 
\u> ■ v\ — y/ e(t, v) a,i = t^\v\~ T , so that 



< 7 2 v /i(t^J < 



7 2^ 



(E.l) 



for all e(t, v) G <B'_i- Therefore we have to exclude a set (£'_ 1 C [0, eo] of measure 



meas((£'_ 1 ) = [ de < f dt 



f|>mo 



dt 



< 



E ■ 



7 4 v^o 



< C 7 



(i-'-r)/' 



CO 

a 



(E.2) 



for some universal constant C . Hence meas(€'_i) is much smaller than eo if eo is small and t' > r + r. 

To impose the Diophantine conditions (I10.7a[) one can reason in the same way. One uses that A'™' (e) = 
a. ( _ r e + 0(e 2 ) for i = r + 1, . . . , d (see Appendix[F|, which yields |A| n '(e)| < 2a{- r s and |dA-"'(e)/de| > 
a.;_ r /2; here and henceforth the derivative is in the sense of Whitney. Again we have to consider only 
the values i/eZ r such that \i>\ > rriQ. We define €' n as the set of values e G [0, eo] which do not satisfy 

(j!0.7ap and for all \v\ > m and t G [-1, 1] we write \uj ■ v\ - \J \ [ " ] (e(t, v)) = 1 2- { - n+1 ^ 2 1 \v\- T ' . Then 



dt<^ 



< 2-^ n+1 ^ 2 — 



4y/2e 



s/ 7 ' 



4^ 



(E.3) 
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for all e(t, v) £ <£' n , and hence 



3«) < C2- (n+1)/2 7 



(r 7 — r) /t 



(E.4) 



for some universal constant C. Therefore one has meas(l£^) = 2 (™ +1 )/ 2 o(£o)i provided r' > r + r. 

To impose the Diophantine conditions (|10.7b[) . the only difference is that we need a lower bound on 

the derivatives d(\J (e) ± \J (e)) / de, r + l<i,j<d, with i ^ j when the sign minus is taken. One 
easily realises that the conditions with the sign plus do not present any further difficulty with respect to 
the first Melnikov conditions. Moreover, for e small enough and r + l<i^j<d one has 



de 



> 



8VAe 



a := mm \a k - a h \ 

l<k=£h<s 



To deduce (|E.5[) one uses that for i = r + 1, . . . , d. one has X\ 1 (e) = ai- r e and A^ 1 (e) = a,_ r 
n>0. By defining l£" as the set of values e £ [0, £q] which do not satisfy (110. 7b|) . we obtain 



meas(0 < C2^ n+1 ^ 2 1 



(t'-t)It 



V 



a 



(E.5) 

:+0(e 2 ), 

(E.6) 



for some universal constant C, so that once more meas(l£") = 2 (" +1 )/ 2 o(eo) for r' > r + r. The sets Gv, 
in Section [TO] are defined as <£„ = <£' U <£". 



F Multiscale analysis for elliptic tori 

To extend the multiscale analysis to the case of elliptic tori, we slightly change the recursive definition of 
propagators and self-energies. We set A^ 1 ^(x,e) = x 2 and 

/ d _\ ~ 1/2 

& [n K*,e)=(~f2(x 2 -i l - 1] (e)) j , n>0, (F.l) 

and define 

n n—1 

Sn(s,e) = IIxpCA^^e)), e ) = ^(A^ (x, e)) \{ x^A^ (x , e)) , (F.2) 

with the functions \ n and defined as in Section [5j with the only difference that in (|5.2|) 7 and 7/2 
are replaced with 7 2 and 7 2 /4, respectively. 

In terms of the quantities (|F.1|) and (|F.2|) , the propagators <3e = G^ ni! \uj ■ vi,e) are defined iteratively 

as 

G [n \x,e) = <S> n {x,e) (d {x)t - M [ri " 1] (x,e)) 1 (F.3a) 
M [n] (x,e) = M [n - 1] (x,e) + E n (x,e)M^{x,e) 1 M^(x,e) = J2 £ KT) Vt{x). (F.3b) 

Finally the numbers a'™'(e) are defined according to (|10.8[) . where A[ (a;, e) are the eigenvalues of the 
matrix M^ n \x,e). 



31 



If a line £ is on scale n and ^ 0, then one has 



mm 

Ki<d 



mm 

Ki<d 



(wi/) 2 -Af ] (e) <2- 2p 7 2 , 0<p<n-2, 



> 



Therefore, setting u> ■ v = x, if x > one has 



< max 



d x X l r 1] (x,e) 



where 9 X A-™ e) = 0(e 2 ) and 



< ia!"- 21 ( £ )-^ 2 i < 2- 2 (™- 2 ) 7 2 

Jn-2] 



iVAr^)+*i 



5rt 



and hence 



*i n - X W)-A! n - 1] (e) 



< Ce 2 



-2)1 



for some positive constant C. Therefore (|F.4bp and (|F.7|1 imply 



(x,e) 



> 



> 



x 2 -X 



n-1] 



00 



(F.4a) 
(F.4b) 

(F.5) 

(F.6) 
(F.7) 
(F.8) 



The case x < is discussed in the same way noting that a[™ x,e) = A," because of (17.11) 



Therefore the eigenvalues x 2 — X\ (x, s) can be bounded from below by half the quantities x 2 — X\ (e)- 

e 2 mentioned after (|10.8p follows from the expression 



The property |Af J (e) - Aj n 1] (e)| < Qe 



-Kl2 



i/r' 



(|F.3bp for A^l"l(x,e) — M\ n ^(x,e), the bound (|7.5p for the values of the self-energy clusters, and the 



bound J2 V 



GJV(T) 



v„| > c"2"/ r ' which holds for any T G ft n . 



Finally we want to show that the bounds 

m* n (6) <c2- n/T 'K{9), m* n ,(T)<c2- n ' /T 'K{T), TeK n , n'<n, 



(F.9) 



hold with the multiscale analysis described above. One proceed as in Appendices O and [Bj with the 
following changes. If the propagator of a line with momentum v and scale n is non-zero, then (|F.4jl imply 
\u\ > 2(™~ 2 )/ r . When discussing the analogous of the case s = 1 in Appendix |A"1 then (|A.1|) must be 
replaced with 



2-"+ 3 7 > | cj ■ v x + a x J A["- 2] (e) | + | u> ■ v 2 + <j 2 J ^ 2] (e) 



> \ui-(ui-v 2 ) + <y^^- 2 \e) - <r^X [ ;- 2] (e) \ > 



v\ - v 2 \ 



(F.10) 



where the signs 01,(72 £ {±} and the labels i, j £ {1, . . . , d} are such that the first inequality is satisfied. 
Analogously one discussed the case of the self-energy clusters. 

In particular (|F.10|) explains why the second Melnikov conditions are necessary. Of course, if we had 
used directly the eigenvalues A! n '(ai, e), instead of the quantities A[ n '(e), we would have require (|10.6[) 
instead of (|10.7b|) . We have already seen in Section \W\ why this was not allowed. 



Acknowledgments. I thank L. Corsi for her very careful comments on the manuscript. 
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